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(57) Abstract 

The present invention presents a method for paititioning that provides 
both a relevant metric and a set of clusters ftrougb an evohitionary learning 
process. The present invention further presents a method for determining 
consumer demand (304) tliat finds the context dependent, or combinatorial 
optimized set of properties, uses, or customer features that optimize the vahie 
of a product to the customer base. Itie present invention further includes 
a framework for the mariceting and introduction of novel {Hxxlucts. The 
framework has means to model customers and derive an optimal set of goods 
(308) to produce alone or in the face of a coevolving competitive environment 
where o^er fiims are introducing and modifying their own goods. 
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A METHOD FOR PERFORMIKQ WiBKBX SE6KENTA.TZ0N J^KD 
FOR PREDICTING CONSUMER DEMAND 

FIBLD OF TBB XJSlTSSfSXOSf 

5 The preftdiit Invention relates generally to methods 

for performing market aegmentation and to methods predicting 
constimer demand* More specifically, the present invention 
performs market segmentation by determining dissimilarity 
measures and predicts consumer demand by eonstnxcting a ^ . 

10 landscape of consinner demand through focueed satnpling. 

fiACXGROUND . 

The problem of determining what combination of 
factors in a given product (from toothpastes to cookies or 

15 cars) will attract customers is a difficult one because the 
relationship between zhe factors may be highly nonlinear and 
because the ratings of factors by cus comers may be biased for 
various reasons (fozrmilation of the questions, rating scale, 
customer answers do not reflect accual preferences, etc.) . 

30 por marlceting purposes, a company would like to be able to 
find clusters coK^sed of a sufficient number of customers 
wiwh similar preferences in order co either launch a new 
product, or adjust an existing product, adapted to such 
preferences. Of critical importance is that customers with 

25 similar sets of preferences ba assigned to the same cluster 
and that ciistomers with significantly different sets of 
preferences be assigned to different clusters. 

Clustering algorithms exist that can generate 
clusters that satisfy either one or both of these 

30 constraints. Multidimensional scaling methods go one step 
further to allow visualization of high- dimensional data 
clusters in a low -dimensional embedding space. But 
clustering algorithms and multidimensional scaling methods 
always assume the existence of a well-defined metric or 

35 dissimilarity measure in attribute space, here the space of 
factors that contribute to a product. 
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Accordingly, there exists a need for a method for 
partitioning that provides both a relevant metric and a set 
of clusters. 

Mext, a large body of mathematical and statistical 
5 work exiats concerning mean3 to estimate the optimal 

composition of a good or service for a given customer, or 
population of customers. This body of work contains 
techniques, known in the art, such as CART, and discrete 
choice providing means for determining utility functions over 
j^O a apace of properties of a good or service for a given 
consumer, as well as means of considering a population of 
different customers with different preferences over that 
space of properties and attempting to "segment" the customer 
population into subgroups which may than be specifically 
15 targeted by marketing, or shifting the property mix of each 
product produced and the vector of products to "match" 
optimally the customer population. Typically the aim is to 
maximize profit for the firm. 
25 The means in the art, in general, atternpt to fit 

20 the observed data points by building up sketches of the 
utility surface for a given consumer or class of consumers 
using, in the simplest case, linear regression of the data 
points on all the property axes. Different classes of 
consumers are discriminated by discovering different linear 
25 regression patterns for different, e.g., demographic classes. 
In more sophisticated approaches, attempts are made to model 
the possibly curved properties of Usoultility" surfaces in 
^5 the space of properties for a given consumer, or class of 

consumers, by fitting higher order polynomials to the sampled 
30 data. The generic problem with this approach is that the data 
sampled must be used ro estimate the coefficients of the 
monomial and polynomial terms, and the finite amount of data 
is typically used to characterize the lowest order terms, 
monomial, (Quadratic, etc, first. The data is typically "used 
35 up in obtaining reliable statistical estimates of these low 
order terms, and little or none is left over for use 
estimating higher order terms. 
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on the other hand, the higher order terxtui are 
precisely the measures of the c«nplex context dependent 
tnteractlons among properties of a single good.or service or 
a collection of goods or services. A trivial example is 
, breakfast, consisting of ham and eggs. These two are 

traditionally called -consumption complements" by economists. 
The utility of ham for many consumers is much higher m the 
presence of eggs, than alone,- so too with eggs. Another case 
is niche marketing of cellular telephones. Not only are these 
. n phones of Interest . to high volume users with expense 

accounts, but to low volume users who happen to be women with 
small children driving in rural areas and worried about an 
accident and no means of calling for help. The context 
dependence of the properties: woman, mother, with child in 
15 car. accident and safety demonstrates the comblnatorxal 
character of one niche occupied by this product. 

Oa a siirpier level, a given product, say soap, 
might be characterized by a number of features = color, shape, 
smell, saponin content, mass. etc. Or coca-cola packaging may 
,0 be characterised by a nunier of properties, number of cans, 
size of cans, fluid in can. color of package, etc. 

Accordingly, there is also a need for a method for 
determining consumer den^nd that finds the context dependent, 
or combinatorial optimized set o£ properties, uses, or 
25 customer features that optimize the value of a product to the 
customer base. 



SOMMARy OF THE IHVEHTIOH 
The present invention presents a method for 

partitioning that provides both a relevant metric and a set 

of clusters through an evolutionary learning process. 

It is an aspect of the present invention to present 

a method for partitioning a space of data comprising the 

steps of: 

choosing a plurality of dissimilarity measures, 



10 



15 



WO 00/02138 PCT/US99/1 5236 

partitioning the space for each of said plurality 
of dissimilarity measures; 

evaluating said partitioning for each of said 
plurality of dissimilarity measures; and 
g selecting one or more of said dissimilarity 

measures on the basis of said evaluation. 

The present Invention further presents a method for 
determining conDumer demand that finds the context dependent, 
or combinatorial optimized set of properties, uses, or 
10 customer features that optimize the value of a product to Che 

customer base. 

It is an aspect of the present invention to present 
a method for determining customer demand for products 
20 con5)rising the steps of: 

defining a space having R dimensions wherein each 
point in said space corresponds to a vector of properties; 

constructing a landscape for said space comprising 
the steps of i 

locating at least one point on said space 
20 where a predetermined customer would purchase a product 
having said corresponding vector of properties at a 
predetermined price; and 
30 sampling a set of points on an R-dimsnsional 

sphere surrounding said selected point at a predetermined 
25 step length from said selected point to determine a first 
subset of said set of points where the predetermined customer 
would make a purchase at said predetermined price and to 
determine a second subset of said sampled points where the 
customers would not make the purchase at said predetermined 
30 price, said first subset of points and said second subset of 
points form at least one indifference surface between a 
buying region and a non-buying region at said predetermined 
price. 

The present invention further includes a framework 
35 for the marketing and introduction of novel products. The 
framework has means to model customers and to derive an 
optimal set of goods to produce alone or in the face of a 
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coevolving competitive environment where other fix.s are 
introducing and nidifying their own 

It is a further aspect of the present inve 

evaluating said plurality of candidate maps and 
,0 said family of agent-based models with respect to aaid 
eon.u.er data, ^^^^ ^^^^^^^^^ 

„ndidate and^aid family of a.ent ha.ed models ha.ed on 

^"i::: at least one operation on .aid selected 
candidate maps and said selected agent-based ^^^^ 
Tnera" a new plurality of candidate ««ps and a new family 
of agent-baaed nusdcls. 

BMEP DBSCRtPTIOS OF THS DRAWINGS 
^° FIG 1 provides a flow diagram of the adaptive 

dissimilarity partitioning method 100 of the present 

. provides a flow diagram of a method for 
„ determining consumer demand 200 that finds the context 
" recent, or combinatorial optimized set of propert.ee 
ue'es. or customer features that optimize the value of a 
,.oduct to^the ,_wor. 300 

^ marketinci and introduction of novel products. 

30 .or the -rketlng^^^^^^^^ ^ ^^^^ ^^^^^^^ 

creating a model of customer preferences, 
creating a^^^ ^ ^^^^^^^^^ ^ representative computer system 

in conjunction with which the enO^odiments of the present 
35 invention may be implemented. 
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OBTAINED DBSCRIPTIOM OF THE mPERRED EMB0PIM8NT 
The present invention presents methods for 
partitioning that provide both a relevant metric and a set of 
thUh an evolutionary 

ntusrative context of n«rket segmentation. However, it 
illustrative co» aklll in the art that 

will be apparent to persons of ordinary skill m cn 
the aspects o£ the embodiments of the Invention are also 
'° ap 1 cable in any context in which the natural metric ox 
dissimilarity measure of attribute space is not precisely 

Let us define a set of n m-dimensional data vectors 
V (i-l . n). The coB5)onent8 x„ {J-1 m) may be real 

" vJ ia^ies.' binary variables, or other types of variables^ 
ane aim of a typical clustering algorithm i« " J^^^ 
data points to clusters to minimize some cost '^^^^^^'r . 
proto^e vector is usually associated with each c^-^-^ ^ 
clusteVis defined as the set of data vectors that are closer 

" he Cluster's prototype than to any other prototype^ For 
example, in the k-means clustering algorithm, one has to 
::termi;e the coordinates of k prototype vectors y. (h=l. 
k) to minimize the following cost functions 



25 

a k 
M h-1 



Where m^-l if - ^ " .1 

otherwise, and \.\ is a distance in the space of data 
vectors. The kUans clustering algorithm is explained in 
sL methods, for classification and analysis of multi variate 
33 Observations, McQueen. ... P-c. Fifth Berkeley Symp - 
Mathematical Statistic, and Probability. Vol. 1 (I- Cam. U 



J., eds), untverfltty of California Press, 



M. El Neyman 
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10 



Berkeley, CA. 1965. pp. 281-297. An acceptable clustering 
solution is given by where each data vector is assigned 

to one and only one cluster, in the k-means algorithm, the 
cluster prototypes are initialized with the first k data 
vectors. A new data vector x,, i>k. is assigned to the 
closest prototype vector yMi,. ShB prototype is adjusted in 
response to x„ or, more precisely, is moved closer to x^. 



15 



20 



1 

0-1 

The total adjustment of Che prototype is normalized to the 
nuirher of vectors that have already been assigned to that 
prototype. A randomized version of this algorithm, 
2^ supplemented with topological constraints on prototypes, is 

20 the self-crganizing map, an unsupervised neural network. 
Unsupervised neural networks are explained in The self- 
organizing Map, Kohonen. T., 1990, the contents of which are 
herein incorporated by reference. We have assumed the 
existence of a well-defined distance |.J . Sometimes, only 
25 pairwise (or higher-order) relationships among vector 

components are available. In such cases, the cost function to 
be minimized is 'the product of the dissimilarities of data 
vectors assigned to the same cluster. 

Multidimensional scaling (MDS) is used to represent 
30 data points in a two-or three-dimensional Euclidian space 
such that pairwise distances in representation space closely 
40 match pairwise dissimilarities as explained in 

mitidimenaional Scaling, Cox, T. F. & Cox, M. A. A. , Chapman 
& Hall, London, 1994 { «Jtful tldiznenslonal Scaling") » the 
35 contents of which are herein incorporated by reference. A 
clustering algorithm can be applied to the representation 
Vectors. Let y^ be the vector that represents data vector x^. 
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Let dm be the diBtance between two representation vectors 
and K = *nd D,, the dissimilarity between x, and 

x„. The coBt function {also called stress) to be minimized 
19 typically given byi 

5 

W t»-l 
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10 

where the weights W|„ are Introduced to normalize the 
absolute values of the disparities D^. A common choice for w^^ 
• is 



15 L 



tt'\ M 



■ 20 Other definitions of stress and algorithms for minimizing 
screes are surveyed by ftful tidiraensional Scaling. 

In both clustering and MDS, the initial 
dissimilarity measure is assumed to be Jcnown. Given the 
dissimilarity, a clustering algorithm provides clusters 
25 whereas MDS provides a low-dimenaional representation. The 
obtained clusters or representations critically depend on the 
choice of the dissimilarity measure. Such a measure is 
usually defined on the basis of "intuitive" criteria and 
relies on the "expertise" of the designer. Defining a 
30 dissimilarity measure, however, can in principle be 
automated. Clustering or scaling data, although it is 
40 sometimes used for exploratory data analysis, is usually a 

first »'preprocessing« step in a particular task to be 
performed (compression, understanding, market segmentation, 
35 etc.) . The performance of clustering or MDS can therefore be 
measured not only with respect to the coat function or stress 
to be minimized but also in connection with the task to be 
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5 performed. The appropriate dissimilarity measure can be 

learned in a aupervieed manner on a training set. tested on a 
validation set, and applied to new data. The proposed 
learning algorithm is a genetic algorithm. Genetic 

,0 5 algorithms are described in Genetic Algorithms in Search, 

pptlmzatlon and Machine Learning, Goldberg, D. E., Addlson- 
wesley, Reading, MA., 1989 (Genetic Algorithms in Search, 
optimization and Machine Learning) , the contents of which are 
herein incorporated by reference. 

15 pjg_ 1 provides a flow diagram of the adaptive 

dissimilarity partitioning method 100 of the present 
invention, in step 102. the method 100 chooses a family of 
of distance functions or dissimilarity measures. In step 

20 104, the method 100 randomly generates a population of 

15 dleslmilarity measures D" = {d^} or distance functions d' in 

the chosen family, where v is the index of a given 
dissimilarity measure in that population. Each "individual- 
v is encoded into a "genotype". 

In step 106, the method 100 performs clustering or 
" multidimensional scaling with a given algorithm for each 

distance function or dissimilarity measure. In step 108, the 
method 100 evaluates the performance of cluscering or 
rmltidimensional scaling and assigns fitness to every 
_g dissimilarity measure v. In step 110, the method 100 selects 
individuals on Che basis of fitness. In step 112, the method 
100 applies operators to selected individuals and pairs of 
individuals. Preferably, the operations are genetic 
operators such as mutation and crossover. 

In step 114, the method 100 determines whether the 
partitioning results are satisfactory with respect to the 
40 fitness computed in step lOB. If the partitioning results 

are not earlsfactory, control returns to step 106 to perfoim 
clustering or multidimensional scaling for each new distance 
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35 



45 



50 



33 functions or dissimilarity measure created in steps 110 and 
" 112. If the partitioning results are satisfactory, control 
proceeds to step 116 where the method 100 terminates. 
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Th6, distance function cr dissimilarity measure can 
be represented by a true function of the vectors' coordinates 
or by a set of pairwise relationships. Vfhen only pairwiee 
relationships between data vectors are available one needs to 
5 generalize the dissimilarity measure to data vectors which 
have not been presented. The 3in?>lest generalization 
procedure is to use a locally linear interpolation, using thft 
k nearest neighbors: the dissimilarity between the new vector 
V and any other vector 0 is given by the average 
10 dissimilarity between the k nearest neighbors of V and 0, 

The following exan«)le3 illustrates the operation of 
the adaptive dissimilarity partitioning method 100. Let us 
assume for definiteness that each data vector is two- 
dimensional. The two components of represent two properties 
^5 of a cookie, fcr example, sweetness and chewiyness. A set of 
n customers is asked to determine the respective levels of 
sweetness and chewyness they like in a cookie, on a scale of 
1 to 10 for each property. In addition, each customer is 
asked to tell which type of cookie he or she is currently 
20 using. Assume that k different types of cookies are 

represented. The distance function in the space of customer 
preferences is unknown. Por exanrole, one factor may be more 
important than another. A simple family of distance functions 



ist 

25 



where f^ and f, are, for example, second-degree polynomial 
functions of their variables. Each function is characterized 
by 15 parameters, the coefficients of the polynomials. The 
variations of these parameters is assumed to be restricted to 
{-10,10]. A clustering algorithm, such as k-raeans, is 
applied to the data set using this distance function. The 
fitness of a distance d"" is given by 
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where M^^ is the number of customere assigned to the satne 
5 cluster that do no buy the aame cookie type and M^, is the 
number of customere assigned to different clusters that buy 
Che same cookie type. Depending on the task at hand, these 
two types of (nismatch can be given different weights. 

The best individuals obtained after, say, 1000 
10 generations of the genetic algorithtr., correspond to distance 
functions that allow to obtain the right clusters of 
customers in the sense described above. 

The adaptive dissimilarity partitioning method 100 
of the present invention finds the natural dissimilarity 
15 measure or distance function in a space of attributes. This 
function may be unknown. Instead of resorting to ad hoc 
functions, the method eystematically generates a distance 
function adapted to the task at hand. The obtained distance 
function reflects the true structure of the space of 
20 attributes and therefore can be used, in the context of 
market segmentation, to cluster customers, extract the 
"natural" clusters in the data using a non parametric 
clustering algorithm (that is, one in which in the number of 
clusters is not predefined) , extract the effective dimension 
25 of the space of preferences, test product differentiation, 
improve positioning by product adjustment, and test potential 
new products, taking into account the cost of moving from one 
product to another or of launching a new product. 

Other significant areas of application include 
30 protein data visualization, protein function and structure 
prediction, dimensionality reduction for virus data sets, 
general classification and pattern recognition problems, and 
data visualization, including database visualization and 
navigation. 

2g In another example, two hundred two-dimensional 

data vectors were randomly generated. Let x^i and x^j be the 
X- and y-coordinates of the ith data vector, x^i and x^, are 
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dravm from a uniform random distribution on 10, U . 
assume that x,. and x„ represent customer preferences for tv«, 
selected features of a given product type, that two products 
are on the market, and that a customer i purchases 
s if and only if x..<0.5 and purchases product 2 i£ and only if 
X. »0.5. in this example, therefore, only x„ i3 relevant in 
the determination of what product is purchased by a customer 
whose preference vector is x„) . But this information is 

not known to the analyst, who simply assumes that the 
,0 relevant distance in preference space is, for example, the 
Euclidian distance. Using such a distance, the analyst will 
be unable to correctly segregate customers into two classes. 
What the algorithm has to find is the relevant distance in 
preference space that will naturally l*ad to the correct 
segregation after application of a simple clustering 
algorithm. Here we use a modified version of the k-o-^ans 
clustering algorithm with k-2. Two centroids are initially 
located at (0.5, 0.2S) and (0.5. 0.75). Ideally, after 
application of the clustering algorithm with the appropriate 
,0 distance function, the centroids should converge to (0.25, 
0.5) and (0.75, 0.5). Remember tha-. with this clustering 
algorithm a data vector belongs to the cluster whose centroid 
is closest to that data vector. Let C.,4, be the centroid 
closest to vector x. ArgMin(d{C„,x,)) , where d is the 



^' distance function) . and C,„, the jtb coordinate (j.1,2) of 
35 C.„i. The centroid update upon presentation of x, is given 

by: 

<l(CBm>Xi) 



where d is the current distance function, o() is the sign 
35 function (o(u)-*l if u>0, o(u)-l if u.O. and o-O if u-O) , 
ie a learning rate, and n=200 is the number of data vectors. 
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The family of distance function used in this example has 
three parametere: 

Where w, a, and P et0,2] . Whan w»l and a- P«2, the usual 
Buclldian distance is recovered, and when w«l and ot-p-l oj 
gets the city-block (or distance. 

This family of distance functions can easily be 
generalized to higher- dimensional spaces. For example, 1( 
us consider a D-dimenslonal epace: 



15 




20 with 
25 

where a,{p-l D) and Wp (p-1,. .,D) are 2D parameters (of 

which only 2D-1 are free parameters) that determine the 
relative importance of the pth coordinate and the amount of 
distortion along the pth coordinate, This family of 

3^ functions assumes no correlation among coordinates, which is 
certainly a limitation in certain cases. Other distance 
functions should be used in such cases. 

For the simple two-dimensional example, a simple, 
fitness -proportionate (+eliti8m) genetic algorithm (GA) was 

35 used with the following fitness function for distance d*: 
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where K», is the tiunJoer cf customers assigned to the same 
cluster that do no purchase the same cookie type and M«, is 
the number of customers assigned to different clusters that 
buy the same product. The population size was 40, the 
10 mutation rate 0.1. and crossover was replaced with averaging 
of oaraTnetere (that is, two selected individuals produce one 
offspring the parameters of which are the arithmetic average 
of its parents' parameters). After 10 generations, the OA 
finds values of the parameters that consistently produce a 
15 perfect clustering of customers after application of the 
modified k-meana algorithm. During one application (200 
iterations) of the k-means algorithm for "bad" values of the 
parameters (w.0.96. a-l.ai, U-1.77), close to the Euclidian 
distance, the centroids are unable to move to the optimal 
20 locations and remain confined in the vicinity of their 

initial values. For "good' values of the parameters found by 
the GA after 10 generations (w-1.98. a-1.67, p=0.03), the 
centroids move to the optimal locations because the distance 
function assigns almost all the weight to the x-coordinate. 
25 The OA has therefore been able to find a distance function, 
within the family of distance functions, that reflects the 
"true" structure of preference space. 

Assume now that instead of being uniformly 
distributed in 10,11 x 10,U customers form four clusters 
30 (with the same -purchase" rule: customer I purchases product 
1 if and only if x,:<0.5 and purchases product 2 if and only 
if xi- »0.5) . TWO situations can occur; the four clusters may 
discriminate along the y-axis or along the x-axis. Upon 
application of a non-parametric (number of clusters 
35 undefined) clustering or multidimensional scaling algorithm, 
45 the situation where the four clusters may discriminate along 

the y-axis should lead to the detection of 2 clusters while 
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the situation where the four clusters discriminate along the 
X-axis should lead to the discovery of 4 clusters if the 
appropriate distance function Is used. If the Euclidian 
distance function is used both situations lead to the 

5 detection of 4 clusters. A non-parametric (ant -based) 
algorithm leads to 4 clusters in both cases using the 
Buclidian distance. The same algorithm leads to 2 clusters 
when applied to the situation where the four clusters 
discriminate along the y-axie and 4 clusters in the situation 

10 where the four clusters discriminate along the x-axis. 

in an alternate embodiment, for more complicated 
problems, general function approximators such as neural 
networlcs are used instead of family of distance functions. In 
the case of neural networks connections weights are evolved 
using the genetic algorithm. 

In another alternate embodiment, GA is interactive: 
the outcome of the clustering or MDS algorithm is evaluated 
by a human observer who picks the good solutions. 

The adaptive dissimilarity method 100 is also 

20 applicable to graph partitioning. Let Ga(V,E) be a non- 
directed graph. V- {vi)ui n Is the set of n vertices and 

a subset of VxV, the set of edges, of cardinal |e|. B can be 
represented as a matrix [e^j] of edge weights, e.^ being the 
weight of edge (Vi,Vj), where e^j I'D if {vj,Vj) e E and ej^aO 

25 if (Vi,Vj) e E. The bipartitioning problem consists of 

finding 2 sets of n/2 vertices each such that the total edge 
weight between clusters is mimimal. This problem is known to 
be NP-conplete, and many heuristics have been proposed to 
find reasonably good solutions in polynomial time. The 

30 question we may ask is the following: is there a natural 
distance in connection space (where the coordinate cf a 
vertex v^ is given by e^^, j»l, . . n) such that the 
application of the k- means clustering algorithm (k»2) 
generates a good solution of the bipartitioning problem? 

35 The adaptive dissimilarity partitioning method 100 

has been tested on random graphs 0(n, c, pt. P.), where n=100 
is the total number of vertices, c=2 is the number of 
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clusters, p, is the probability that two vertices within a 
cluster are connected, and P. is the probabilicy that two 
vertices belonging to different clusters are connected. 
Edges are characterized by e,^«l if (v,,vp e E and e^.-O if 
(v.,v,) e Such graphs are convenient to test the algorithm 
because the optimal solution of bipartitioning is known when 
c.2: the optimal partitioning solution consists of having as 
many vertices as possible that belong to the same graph 
cluster allocated to the same partition cluster. Ohese graphs 
have been introduced and are used as difficult benchmark 
problems in the context of VLSI design. 

A modified version of the k-maans algorithm is 
applied. The two centroids are initially assigned random 
coordinates. Let Qmh be the centroid closest to vertex v^ 
15 (c,„ ^ ArgMin WC,,v,;;), where d is the current distance 



function), and C^^^,. the jth coordinate (j«l,...r n) of C,,,;. 
25 The centroid update upon presentation of Xi is given by: 



20 



25 

where d is the current distance function, oO is the sign 
function (a(u)=+l if u>0, c(u)«-l if u<0, and o=0 if u-0) , n 
is a learning rate, and n=200 is the number of data vectors. 
The family of distance function used in this exaitqple has 
three parameT:ers: 
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whare w e tO,ll. and a and 0 e [0,2] . When w«l, one gets 
usual distances. The first terra contains only zeroth and 
first-order relationships between the two vertices: this 

15 term is small when the two vertices are connected (Oth-order) 
and are connected to the same set of vertices (first-order) , 
The second term, which gets activated when w<l, represents 
second-order relationships between two vertices: this term is 
email when the neighbors of the two vertices have a lot of 

20 adjacent vertices in common. Such relationships may be 
important for graph partitioning, but the extent to which 
they improve the partitioning is not known. 

The fitness function used in the GA for distance d'' 

is given by; 

25 

1 



1 + + 
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35 



where E^^t is the total inter-cluster weight, and n^ is the 
number of vertices assigned to cluster 1. The |nt-^term is 
there to favor well-balanced solutions. 

The present invention further presents a method for 
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deterroinlng consumer demand that einda context aependenc, 
or combinatorial optimised set of properties, uses or 
^stomer features that optimize the value of a product to the 

customer base. . . 

Previous work has developed a general model of 
' rugged fitness landscapes called the m model as explained in 
TeOri^ins oi Order, Stuart A. ^^"7' ^t^^^ ^^^^ 
press. 1993. Chapter 2, the contents of which are herein 
incorporated by reference. The ^ ««del is ^^^jf^'^^' 
,0 At Ho.e in the Ctolver.e. Stuart A. Kauffman. Oxford 

university Press, 1995, Chapter 9, the contents of wh.ch are 
herein Incorporated by reference. 

NK landscapes are menO^ers of a still mora general 
Class of models in physics, and known in the art as P spin 
,S models. A P spin model consists of N spins, each of wn.ch 
. can take on a discrete number of values, say -1 and .1, or i 
and 0. or a.b.c.d. Each spin contributes an -energy to^he 
total energy of a system of N spins. The energy of a given 
spin configuration of the N spins is given by the ^um «f -he 
,0 energies of the K spins. Bach spin's energy 

in general, given by a sum of a monomial term which is a 
function of its own state, plus quadratic terms which are 
sums of energies .hat are functions of the states of all 
spins that influence it in pair-ise interactions, plus a 
sLilar sum of cubic terms listing all the contributions of 
all triples Of spin, of which that spin is a member, plus 
highe^ order terms. In the m model. K is the highest order 

coupling. 3pi„.gi„3 «»dels. the discrete system has 

„ a rugged -fitness- "cost" -efficiency- or -utility" landscape 
over the combinations of states of the N spins. New 
techniques have been developed to characterize a number of 
features of such landscapes. And it is these features that 
allow ready assessment of the importance of higher order, 
c-mbinatorial properties on landscape structure. These 
properties Include: 1) The number of peaks in the landscape, 
2) The expected nunu^er of steps to a peak from any given 
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5 . . . hv.^ i>indacat>e. 3) The dwindling number of 

of different p«..c3 that can be clinibed from a ^insle point on 
the landscape by adaptive walks which must proceed only 
, uphill. B) The correlation structure of the landscape wh ch 
' roughly, the correlation between fitnesses at two points 

on the landscape as a function of their distance. 

These properties of discrete landscapes, wnere the 
spins take on only discrete values, a,b,c,d. . . can be 
10 generslized to the case of continuous di,nensions. «here each 
variable is « real nu<rt>er. This continuous case, the lengths 
of walks uphill, and dwindling directions uphill must be 
parameterized by a 'step length- in the space of reasonably 
snooth hill sides, any point on the landscape that is on a 
„ hillside has the property that, for infinitesimal steps away 
from that point, half the directions are uphill ,nd half are 
downhill, only on ridges, saddles and peaks is that false- 
However, if a discrete step length, say 100 yards, is 
specified, then as a walk continues uphill and a ridge or 
saddle or oeak is approached, the 'cone" that xs still upb.ll 
will dwindle. The rate of dwindling is a measure that can be 
used to characterise the ruggedness of a continuous 
30 landscape. Thus, on NK landscapes, with K modestly large, 

the generic feature is that at every step uphill, the number 
25 of directions uphill falls by a constant fraction As 
landscape ruggedness Increases, the fraction by which tne 
35 directions uphill dwindles increases from a few percent to 

50% for fully random landscapes in the K - N 1 "random 
energy" limit. In a similar way. the rate at which the 
uphill cone decreases as walks uphill continue provides a 
measure of landscape ruggedness for continuous landscapes. 

Consider a oroduct space, without loss of 
generality taken to be soap. Features of this product were 
noted above, and in general, include other features of 
,e interest. Consider, to be. concrete and without loss of 

generality, discrete choice methods. A customer is presented 
with different choices of a bundle of properties, or vector 
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of properties. Each bundle is a point in the property space. 
A price is attached to each such point. The customer is 
asked to choose which, if any, he/she would buy. Examination 
of the vector is the property space after a finite number of 

5 such choices, reveals a price in the vicinity of those 

positions in property space at which the customer will just 
stop buying. Thus, on one side of a point on a surface in 
property space, at that price, the customer will not buy, on 
the other side of a point on a surface in property space, at 

10 that price, the customer will not buy, on the other side he 
will buy. The point in question estimates the price for that 
specific vector of properties. By sampling at many points 
for one customer, it is possible to build up a set of points 
that escinates the utility curve, or surface, in property 
space at one price for that customer, hence an indifference 
surface, and a set of such surfaces at different prices, 

For a population of customers, a population of such 
data points can be assembled. In principle, much data could 
be obtained from each customer, but typically it is only 

20 feasible to obtain a limited amount of data from a given 
customer, Typically, this is obtained over a moderate large 
region of property epace. The data points are then typically 
each labeled by a vector of demographic traits, and an 
atten^t is made using standard analysis to discriminate both 

25 high utility positions in the space of properties, and 

simultaneously the targeted demographic populations that are 
well matched to good positions in the space o£ properties in 
order to optimize the vector of goods produced, each at a 
different position in the property space, and targeted to one 

30 or more positions in th© demographic space, such that a total 
figure of merit such as total profit after total manufacture 
and sales. 

The application of landscape ideas can improve 
chege standards procedures both by directing the limited 
35 sampling chat can be done chat it helps capture higher order 
terms, or context dependent features, of these marketscapes, 
helps build statistical models of the right "equivalence 
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class" of the real marJcet Bcape, and helps build actual 
models o£ the actual marketscape. 

FIG. 2 provides a flow diagram of a method for 
determining consumer demand 200 that finds the context 
5 dependent, or combinatorial optimized set of properties, 
uses, or customer features that optimize the value of a 
product to the customer base. In step 202, the method for 
determining consumer demand 200 selects a point in property 
space that lies on a surface that divides a region where a ^ 
^5 predetermined customer would buy from a region where the 

predetermined customer would not buy. 

In step 204, the method for predicting consumer 
demand 200 samples a sec of points on an R-dimansional sphere 
20 surrounding the point selected in step 202. Step 204 

15 contrasts with previous methods for predicting consumer 

aemand that sample widely over the product space. The radius 
of the sphere is defined in a well specified way where the 
radius is defined as the "step length" on the surface. An 
exemplary distance is the Euclidian distance. With the same 
20 customer, or more generally, the same class of customers, 
step 204 characterizes for many points in the spherical 
surface surrounding the point whose price has been 
determined, whether that new point would or would not be 
purchased by the customer at the given price. Since the true 
25 price surface in the space of properties contains the first 
determined point, that price surface will, in general, pierce 
the spherical surface surrounding the point whose price is 
determined. The points on the sphere which are purchased and 
the points which are not purchased determine, in the simpleet 
30 case, a curve of points whose price is the transition between 
buying and not buying at the price. In this way, the 
neighborhood surrounding that first priced point can be 
examined . 

In step 206, the method for predicting consumer 
35 demand 200 determines whether the indifference surface has 
been substantially conflated. If the method for predicting 
consumer demand 200 determines that the indifference surface 
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has not been substantially cotipletea, control proceeds to 
step 208. In step 208, the mathoci for predicting consumer 
demand selects another point on the indifference surface from 
the transition curve determined in step 204. After step 208, 
5 control returns to step 204. Step 204 samples a set of 
points on an R-dlmensional sphere surrounding the point 
selected in step 208. In this fashion, the method for 
predicting consumer demand 200 operates to extern the 
Indifference surface at the predetermined price in any 
10 direction in x:he property space. 

The ruggedness of the indifference surface at a 
given price will show up by any oC the properties we have 
discussed. Thus, measured in property space, the 
indifference surface at a given price may have one or more 
1^5 correlation lengths in the space of properties. These 
correlation lengths, in the NK model are long, for K small, 
and short for K large. Thus, short correlation lengths 
estimate higher order couplings among the properties. The 
cone »'uphill" in property space on an indifference surface at 
20 3 given price can be determined. Good combinations- of 
properties will show up as peaks or minima, depending upon 
direction of definition, in the surface. That is, a very 
good combination of properties in property space will show 
up, for example, as a willingness to pay the fixed price for 
25 a small "amount" of the given vector of properties. Having 
defined a local "peak" in the indifference landscape surface, 
we can define the typical walk length, given step size, the 
peak, and the number of peaks to which one can walk from any 
point. In addition, we can examine the similarity of peaks 
30 climbed from the same or nearby points on the indifference 
landscape at a given price. He can ask if high peaks cluster 
40 j^ear one another. We can ask whether recombination is a good 

means to find the high peaks. If so. we can search out the 
high peaks by focusing our questioning in precise ways, to 
35 look "between" the high peaks on the current landscape, and 
45 hill cllnib from those points to etlll higher peaks. 
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All these properties allow focused sampling of the 
landscape to estimate the higher order context dependent, 
combinatorial features of a given market scape. 

Scacistical models of the sampled market ecape can 

5 be built by utilizing P epin-like models, where che class of 
models with all possible values of the coefficients of all 
the Padic terms in the polynomials constitute the family of 
landscape modele. Maximum entropy Bayesian updating similar 
techniques can then be used to estimate the most likely 

10 landscape parameters to fit the observed data. A major 
difference between the current approach and usual approaches 
is that the detailed sampling in specific regions of the 
indifference surface at a given price yields estimates of the 
how "high" the higher order terms, (K in the Nk model) 

15 actually are. Thus, we can estimate from such focused local 
measurements at several points on the landscape, thac, for 
example, fifth order interactions, P=5, are critical for 
determining the local structure of the market scape. Knowing 
that, we cart use a preponderance of the data to fit or 

20 estimate the S''^ order terms, and only a small amount of the 
data to estimate the monomial terms that may determine the 
overall non- isotropic features of the marketscape on long 
length scales across the marketscape. Thus, we can optimize 
use of the sampled data to discover both long range features 

25 of the landscape and local features. 

Given this analysis, one can derive a class of 
statistical models of the landscape, and specific models of 
the landscape. 

The method for predicting consumer demand 200 was 

30 explained in the context pt confuting an indifference surface 
for a predetermined price in the property space for a 
predetermined customer. However, as is known by one of 
ordinary skill in the art. the method for predicting consumer 
demand 200 could also be used to sample the property space of 

35 the product for a given class of customer at a predetermined 
price or at a set of predetermined prices. Further, the 
method for predicting consumer demand 200 could also be used 
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to arrange the demogtaphically characterized population of 
customers into a customer -scape for any given point in the 
product space, This new approach to market segmentation 
arises by casting the agents into an M dimensional 
5 demographic apace, ^t any given price, we can determine the 
fraction of customers in any small volume of demographic 
space who will buy the good at that point in product space at 
the given price. This determines a "customer- scape « for that 
good at thac price. Once again, the customer- scape is a 
10 landscape, and we can define all the properties noticed 
above: correlation structure, lengths of fixed step length 
walks to peaks, the dwindling cons uphill as peaks are 
climbed, the number of peaks accessible from a given point, 
the similarity of such peaks, and whether high peaks cluster 
3^5 near one another. In the latter case, recombination is a 
good means to search the landscape. This procedure defines 
one or more optimal customer features for a given good, or 
position in product space. The same procedure allows 
25 multiple points in product space to be utilized, indeed just 

20 the points normally utilised, to find the best set of 
positions in product space to match the best targeted 
populations of customers in customer space. Again, the 
advantage of our procedure is thac it allows the higher order 
terms, the context dependent features in customer space, to 
25 be more readily detected, for it tells us that K order terms 
are important. Again, we can then construct statistical 
models of customer- scapes, and models of specific customer 
scapes . 

The present invention further includes a framework 
30 for the marketing and introduction of novel products, which 
is a central function of businesses. FIG. 3 provides a flow 
40 diagram of the framework 300 for the marketing and 

introduction of novel products. The framework 300 concsms 
means to model customers and derive an optimal set of goods 
35 to produce alone or in the face of a coevolving cortroetltive 
environment where other firms are introducing and modifying 
their own gccds. 
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In step 302. th« framework for the marketing and 
introduction of novel products 300 assentoles data on 
cuetomers from statements of preferences on questionnaires, 
point of purchase data, nellson data, etc. In step 304, the 
5 framework for the marketing and introduction of novel 
products 300 creates a model of customer preferences. In 
step 306, the framework 300 uses the models of customer 
preferences created in step 304 to identify preferred goods 
and services. In step 308, the framework considers the 
10 behavior of other firms in the environment in addition to the 
models of customer preferences created in step 304 to 
Identify preferred goods and services in a coevolvlng 
cocrpetitive environment. 

PIO. 4 provides a flow diagram of the method for 
15 creating a model of customer preferences of step 304. In 
step 402, the method for creating a model of customer 
preferences 304 determines whether to perform market 
segmentation. If step 402 indicates that market segmentation 
should be performed, control proceeds to step 404 where the 
20 method for creating a model of customer preferences executes 
the adaptive dissimilarity partitioning method 100 shown by 
the flow diagram of FIG. 1. If step 402 indicates that 
narket segmentation should not be performed, control proceeds 

to step 406. 

in step 406, the method for creating a model of 
" customer preferences 304 constructs a family of linear or 
non-linear models of customers. These models are candidate 
•naps from answers to questions, point of purchase data. etc. 
to the actual predictive preferences of the customers for the 
'o goods in question. Accordingly, an aim of the method for 
creating a model of customer preferences 304 is to order the 
goods in a match to actual preferences of customers. 

in step 408, the method for creating a model of 
customer preferences 304 constructs agent based models of 
35 customers based on default hierarchies, rules of thumb, etc. 
in their strategy space. Default hierarchies, etc. do not 
require that preferences be transitive, which is often true 
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of custowera, In contrast, a preference space does require 
transitiviby. Agent based models of customere are described 
in A SyBtem and Method for the Synthesis of an Bconomia Web 
and the Identification of Wew Mar;cet Niches, Attorney docket 
5 number 9392-0007-999. filed May 15, 1998, the contents of 
which are herein incorporated by reference. Agent based 
models of cueromers are further described in An Adaptive and 
Reliable System and Method for Operations Management, 
Attorney docket number 9392-0004-999, filed July 1, 1999. the 
,0 contents of which are herein incorporated by reference. 

m step 410, the method for creating a model of 
customer preferences 304 utilizes adaptive algorithms over 
the space Of mappings produced by step 406 and the space of 
agent strategies produced by step 408 to find a set of models 
;iL5 that predicts customer purchasing preferences for a set of 
goods. In the preferred embodiment, the adaptive algorithms 
are genetic algorithms. In an alternate embodiment, the 
adaptive algorithms are genetic programming. 
25 In atep 412, the method for creating a model of 

20 customer preferences 304 determines whether the output of 
step 410 has produced good predictive models of customer 
purchasing preferences. If step 412 determines that the 
output of step 410 has not produced good predictive models of 
customer purchasing preferences, control returns to step 406 
25 where processing proceeds with the new set of models produced 
by the adaptive algorithm of step 410. If step 412 
determines that the output of step 410 has produced good 
predictive models of customer purchasing preferences, control 
proceeds to step 414 where the processing terminates, 

f^e previously discussed, in step 306, the framework 
300 uses the models of customer preferences created in step 
304 to identify preferred goods and services. If the 
customers have preferences for may features of a product that 
add up to a single preference landscape, then step 306 
35 executes the method for predicting consumer demand 200 

illustrated by the flow diagram of PIG- 2. In contrast, if 
the customer preferences are not commensurable, then step 306 
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executes aa optimization tool to find tha global pareto 
optimal pointB such as Configuration Sherpa, which is 
described in A System and Method for Coordinating Econontia 
Activities Within and Between Economic Agents. In either 
5 case, one of ordinary ekill in the art would understand that 
there are a variety of clustering and raulti-dimensional 
scaling algorithms that can seek optimal choices of locations 
of goods in the product space to attract the most customers. 
Such algorithms may prespecify the number of goods, or seek 
optimal numbers and locations of goods based on a firm's 
budget constraints, and other aspects of firm operations in 
its competitive environment. 

AS previously explained, in step 308, the framework 
considers the behavior of other firms in the environment in 
15 addition to the models of customer preferences created in 
step 304 to identify preferred goods and services in a 
coevolving competitive environment. Firms compete by 
introducing or improving products. Hence, there is a 
25 coevolutionary dynamic. Generically, there are two regimes t 

20 a **red queen" regime of persistent coevolution in the space 
of products and an evolutionary stable strategies regime 
where all products reach local or global Nash equilibria and 
30 stop moving in product space. Sec At Home in the Universe, 

If the firm con^letes the observe, orient, decide and act 
25 loop (OODA) faster than the other firms with respect to the 
introduction, innovation, improvement and wise placement of 
products, it can systematically win. 
^5 3tep 308 of the framework for the marketing and 

introduction of novel products 300 uses models of customers 
30 and capacity to predict preferences over ths space of 

products to build agent based or other dynamical models of 
the coevolution of market shares of products, utilizing data 
to locate optimal positions for new or in^roved products in 
coevolutionary dynamics subject to constraints on budget, 
35 capacity, and time to market for new or Improved goods, etc. 
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Agent baeed modelB that identify new products are described 
in A System and Method for the Synthesis of an Economic Web 
and Che Identification of New Market Niches. 

PIG. 5 discloses a representative computer syatem 
5 510 in conj'onction with which the embodiments of the present 
invention may be implemented. Computer system 510 may be a 
personal computer, workstation, or a larger system such as a 
minicomputer. However, one skilled in the art of computer 
systems will understand that the present invention is not 
10 limited to a particular class or model of computer. 

As shown in FIO. 5, representative cotnputer system 
510 includes a central processing unit (CPU) 512, a memory 
unit 514, one or more storage devices 516, an input device 
5ie, an output device 520, and communication interface 2922. 
15 A system bus 524 is provided for communications between these 
elements. Computer system 510 may additionally function 
through use of an operating system such as Windows, DOS, or 
UNIX. However, one skilled in the art of computer systems 
will understand that the present invention is not limited to 
2Q a particular configuration or operating system. 

Storage devices 516 may illustratively include one or 
more floppy or hard disk drives, CD-ROMs, DVDs, or tapes. 
Input device 518 comprises a keyboard, mouse, microphone, or 
other similar device. Output device 520 is a computer 
25 monitor or any other known computer output device- 
communication interface 522 may be a modem, a network 
interface, or other connection to external electronic 
devices, such as a serial or parallel port 

While the above invention has been described with 
30 reference to certain preferred embodiments, the scope of the 
present invention is not limited to these embodiments. One 
skill in the art may find variations of these preferred 
embodiments which, nevertheless, fall within the spirit of 
the present invention, whose scope is defined by the claims 
35 set forth below. 
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Claims 

1, A method for partitioning a space of data 
5 coTtiprising the steps of: 

choosing a plurality of dissimilarity measures; 

partitioning the space for each of said plurality 
of dissimilarity measures; 

evaluating said partitioning for each of said 
10 plurality of dissimilarity measures,- and 

selecting one or more of said dissimilarity 
measures on the basis of said evaluation. 

2. A method for partitioning a space as in claim 
3L5 1 further con^rising the steps of: 

performing at least one operation of said selected 
dissimilarity measures to generate a new plurality of 
dissimilarity measures; and 

repeating said partitioning the space step and said 
20 evaluating said partitioning step for each of said new 
plurality of dissimilarity measures; and 

selecting one or more of said new dissimilarity 
measures on the basis of said evaluation* 

25 3» A method for partitioning a space as in claim 

2 further comprising the step of iterating on said performing 
at least one operation of said selected dissimilarity 
measures step, said repeating said partitioning the space 
step and said selecting one or more of said new dissimilarity 

30 measures step to achieve an optimal partition. 

4, A method for partitioning a space as in claim 

3 wherein said at least one operation is a genetic operation. 

35 . 5. A method for partitioning a space as in claim 

4 wherein said genetic operation is selected from the group 
consisting of a mutation and a crossover. 
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5 6 A method for partitioning a space as in claim 

1 wherein said choosing a plurality of dissimilarity measures 
step comprises the steps of: 

choosing a family of dissimilarity measures; and 
^ randomly generating said plurality of dissimilarity 

measures from said chosen family* 

7 . A method for partitioning a space as in claim 
1 wherein said partitioning for each of said plurality of 

^5 dissimilarity measures step is perforrr^ed by at least one 

clustering algorithm. 

8. A method for partitioning a space as in claim 
7 wherein said at least one clustering algorithm is a k-msans 

15 clustering algorithm. 

9. A method for partitioning a space as in claim 
6 wherein said dissimilarity measure is a dissimilarity 
function and said family of dissimilarity measures is a 

20 family of dissimilarity functions. 

10. A method for partitioning a space as in claim 
9 wherein said family of dissimilarity functions for said 
space having two dimensions is: 
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wherein 

x-», x» x^, -and Kpare said data in said apace; 
30 f'and f,are polynomial functions of their variables; and 
f^and f, are characterized by a plurality of parameters. 

11. A method for partitioning a space as in claim 
10 wherein said polynomial functions are second degree 
35 polynomial functions. 
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12. A method for partitioning a space as in claim 
10 wherein eaid plurality of parameters are coefficienta of 
the polynomials. 

13 A method for partitioning a space as in claim 
12 wherein the variations of said plurality of parameters are 
restricted to [-10,101. 

14, A method for partitioning a space as in claim 
10 12 wherein said evaluating said partitioning for each of said 
plurality of dissimilarity measures step comprises the step 
of assigning a fitness to said each of said plurality of 
dissimilarity measures and said fitness is defined by: 

15 J . 



1+ 



25 



wherein: 

is the number of said data that are assigned to the same 
20 partition that do not belong in the same partition; and 

M«,,- is Che number of said data that are assigned to different 
' partitions and do belong^ in the same partition. 

15. A method for partitioning a space as in claim 
6 wherein eaid family of dissimilarity measures are general 
function approximators. 

16. A method for partitioning a space as in claim 
6 wherein said general function approximators are neural 

30 networks having connections weights. 

17. A method for partitioning a space as in claim 
1 wherein said evaluating said partitioning for each of said 
plurality of dissimilarity measures step is performed by a 

35 human observer. 
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5 X8. A iMttod for partitioning a spaca as in claim 

17 wherein said selecting one or more of said dissimilarity 
measures on the basis of said evaluation step is performed by 
a human observer based on said evaluation. 

19. Con^Juter executable software code stored on a 
computer readable medium, the code for partitioning a space 
of data, the code comprising: 

code to choosing a plurality of dissimilarity 

measures; 

code to partition the apace for each -of said 
plurality o£ dissimilarity measures; 

code to evaluate said partitioning for each o£ said 
plurality of dissimilarity measures; and 

. code to select one or more of said dissimilarity 
measures on the basis of said evaluation. 

20. computer executable software coda stored on a 
computer readable medium, the code for partitioning a space 

20 as in claim 19, the code further comprising: 

code to perform at least one operation of said 
selected dissimilarity measures to generate a new plurality 
of dissimilarity measures; 

code to repeat said partitioning the space step and 
25 said evaluating said partitioning step for each of said new 
plurality of dissimilarity measures; and 

code to select one or more of said new 
dissimilarity measures on the basis of said evaluation. 

21. A programmed con^uter system for partitioning 
a space comprising at least one memory having at least one 

40 storing computer executable program code and at least 

one processor for executing the program code stored in said 
memory, wherein the program coda includes: 

code to choosing a plurality of dissimilarity 

45 measures; 
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code to partition the* space for each of said 
plurality of dissimilarity measures; 

code to evaluate said partitioning for each of said 
piuraliry of dissimilarity measures; and 
5 code to select one or more of said dissimilarity 

meaaurea on the basis of said evaluation. 

22. A programmed computer system for partitioning 
a space comprising at least one memory having at least one 

10 region storing computer executable program code and a:: least 
one processor for executing the program code stored in said 
memory as in claim 21, wherein the program code further 
Includes : 

code to perform at least one operation of said 
15 selected dissimilarity measures to generate a new plurality 
of dissimilarity measures; 

code to repeat said partitioning the space step and 
said evaluating said partitioning step for each of ©aid new 
plurality of dissimilarity meae\ires; and 

code to select one or more of said new 
dissimilarity measures on the basis of said evaluation. 

23. A method for determining customer demand for 
products comprising the steps of: 

25 defining a space having R dimensions wherein each 

point in said space corresponds to a vector of properties) 

constructing a landscape for said space comprising 
the steps of; 

locating at least one point on said space 
30 where a predetermined customer would purchase a product 
having said corresponding vector of properties at a 
predetermined price; and 

sampling a set of points on an R-dimensional 
sphere surrounding said selected point at a predetermined 
35 step length from said selected point to determine a first 
subset of said set of points where the predetermined customer 
would make a purchase at said predetermined price and to 
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price. Mio ji^tJenncs surtax beWsm « 

price . 

24. A method for determining customer ^^^'^ ["^f 
products as in clai. 23 wherein said constructing a landscape 

far said space further comprises the steps of: 
0 for said ^^^^^ ,aid Indifference 



surface.- and^^^^^^^ ^^^^ ^^^^^^^ ^^^^ ^,,,,,3d 
20 point to extend said at least one indifference surface. 

" 25 A method for determining customer demand for 

products as in claim 24 further comprising the step of 
ttlrating on said selecting at least one point on said 
indi « n^e surface step and said repeating said sa«^ in^ 
20 3tep from said selected point step to further extend said 
indifference surface. 
30 26. A method for determining customer demand for 

said sampling step. 

3* 27 A method for determining customer demand for 

products as in claim 2e wherein said indifference surface 
30 characteristics comprise a degree of ruggedness. 

A method for determining customer demand for 
products as in claim 26 wherein said indifference surface 
Characteristics comprise at least one correlation length. 

A method for determining customer demand for 



40 23. 



29 A method tor aec.erinii"."^i — ^ 

products a, in claim 26 further comprising the step of 
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locating one or ntora points on said indifference surface 
having a sir^H amount of said corresponding vector of 
properties to identify peaks on said indifference surface. 

30. A method for deterttvining customer demand for 
product* as in claim 29 wherein said indifference surface^ 
characteristics further comprise at" least one typical walK 
length to said identified peaks. 

^5 A method for determining customer demand for 

products as In claim 29 wherein said indifference surface 
characteristics further comprise at least one clustering 
measure of said identified peaks. 

32. A method for determining customer demand for 
products as in claim 29 further comprising the steps of: 

defining a family of possible models to 
represent the customer demand; and 

selecting one or more models from said family 
20 of possible models that are compatible with said indifference 
surface charaoterl sties . 

33. A method for determining customer demand for 
products as in claim 32 wherein said selected models have a 

25 plurality of parameters. 

34. A method for determining customer demand for 
products as in claim 33 further cotiprising the step of 
determining values of said plurality of parameters for said 

30 selected models from said sampling step. 

^ 35. A method for determining customer demand for 

products as in claim 33 wherein said values of said plurality 
of selected parameters are determined using Bayesian 
35 analysis. 
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36. computer executable software code stored on a 

con^uter readaMe .edlu., the code for deterndning customer 

H^m;*nd for products, the code comprising; 
demand tor ^^^^^^ ^ ^^^^ ^^^^^^ ^ 

S each point in said space corresponds to a vector of 

propertiM^^^ to construct a leu-vdacape for said space 

comprising: ^^^^ ^^^^^^ ^^^^^ ^^.^^ 

space where a predeter^ned customer would purchase a product 
having said corresponding vector of properties at a 
predetermined price; and 

code to sample a set of points on an R- 
dimensional sphere surrounding said selected point at a 
,c predetermined step length from said selected point to 
" determine a first subset of said set of points where the 
oredetermined customer would maXe a purchase at said 
predetermined price and to determine a second subset of said 
sampled points where the customers would not make the 

a!e'a. said predetermined price, said ^i»t subset of 
'° ooints and said second subset of points form "^"^^J'" 
Indifference surface between a buying region and a non-buying 
region at said predetermined price. 

37 Computer executable software code stored on a 
" computer readable medium, the code for determining customer 
deld for products as in claim 3e, wherein ^aid cod o 
construct a landscape for said space further comprises, 
code to select ac least one point on said 

,0 indifference surface; and , 

code to repeat said sampling step from said 
selected point to extend said at least one indifference 

surface. 

38 computer executable software code stored on a 
computer readable medium, the code for determining customer 
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demand for products as In claim 37, the code further 
comprising:^^ to iterate on said selecting at least one 
point on said indifference surface step and said repeat ng 
, said san«>ling step from said selected point step to further 
extend said indifference surface. 

39 A progratiuned computer system for determining 
customer demand for products con^rising at least one memory 
,„ having at least one region storing computer executable 
" T:Z- code and at least one processor for executing the 
pr^ram code stored in said memory, wherein the program code 

code to define a space having R di««nsions wherein 
15 each point in said space corresponds to a vector of 
properties;^^^^ to construct a landscape for said space 

comprising: , . 

code to locate at least one point on said 
space Where a predetermined customer would purchase a product 
having said corresponding vector of properties at a 
predetermined price,- and 

code to sample a set of points on an R- 
dimensional sphere surrounding said selected point at a 
predetermined step length from said selected PO^-t to 

" determine a first subset of said set of point, where the 
predetermined customer would m^e a P"-'=''^^«/^ 
predetermined price and to determine a second subset of said 
sampled points where the customers would not make the 

30 ;rchase at said predetermined price, said ""^ ".ubset of 
Lints and said second subset of points form at least one 
^Idif ferle surface between a buying region and a non-buyxng 
region at said predetermined price. 

40 A programmed computer system for determining 
customer demand for products comprising at least one memory 
having at least one region storing computer executable 
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, ^ *t least one processor for executing the 

prog-^m - landscape for said space further 

said code to construct a xana»^<*if' 

5 code to select at least one point on said 

Indifference surface; and 

code to repeat said sanpUng step from said 
selected point to extend said at least one Indifference 
surface. 

^° 41 A programmed co«5)uter system for determining 

customer demand for ptoducts corr^rlsing at least one memory 
having at least one region storing computer executable 
program code and at least one processor for executing the 
p::^ram code stored in said memory as in claim 40. .herexn 
said code further con^jrisest , ^ ^ 

code to Iterate on said selecting at least one 
point on said indifference surface step and said -P-^ «^ 
Zi6 san^>Ung Step from said selected point step to further 
20 extend said indifference surface. 

42 A method for creating a model of consumer 
preferences from consumer data comprising the steps of, 

constructing a plurality of candidate maps form the 
,5 consumer data to actual consumer preferences, 

constructing a family of agent-based models, 
evaluating said plurality of candidate maps and 
said family of agent-based models «ith respect to said 

consumer data; , 

selecting one or more of said plurali-ty of 
candidate maps and said family of agent based models based on 

said evaluation; and ^-i^r^f^A 
performing at leas= one operation on saxd selected 

candidate maps and said selected agent-based ^'-"-^ 
35 generate a new plurality of candidate «.ps and a new family 

45 of agent -based models. 
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43. A method for creating a model of consumer 
preferences from consumer data as in claim 42 further 
con^rising the step of iterating on said evaluating said 
plurality of candidate maps step, said selecting one or more 
of said plurality of candidate maps and said family of agent 
based models step and said performing at least one operation 
on said selected candidate maps and said selected agent-based 
models step to achieve an optimal modal of consumer 
preferences. 

44. A method for creating a nftodel of consumer 
preferences from consumer data as in claim 42 wherein said 
least one operation is a genetic operation. 

45. A method for creating a model of consumer 
preferences from consumer data as in claim 44 wherein said 
genetic operation is selected from the group consisting of a 
mutation and a crossover. 

46. Con^^uter executable software code stored on a 
computer readable medium, the code for creating a model of 
consumer preferences from consumer data, the code comprising: 

code to construct a plurality of candidate- maps 
form the consumer data to actual consumer preferences? 
25 code to construct a family of agent-based models; 

code to evaluate said plurality of candidate maps 
and said family of agent-based models with respect to said 
consumer data; 

code to select one or more of said plurality of 
30 candidate maps and said family of agent based models based on 
said evaluation; and 

code to perform at least one operation on said 
selected candidate maps and said selected agent -based models 
to generate a new plurality of candidate maps and a new 
35 family of agent -based models. 
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47. A programmed cotnputer system for creating & 
model o£ consumer preferences from consumer data, coi^prislng 
at least one memory having at least one region storing 
computer executable program code and at least one processor 
5 for executing the program code stored in said memory, wherein 
the program code includes i 

code to construct a plurality of candidate maps 
from the consumer data to actual consumer preferences; 

code to construct a family of agent- based models; 
IQ code to evaluate said plurality of candidate maps 

and said family of agent-based models with respect to said 
consumer data; 

code to select one or more of said plurality of 
candidate maps and said family of agent based models baaed on 
15 said evaluation; and 

code to perform at least one operation on said 
selected candidate maps and said selected agent -based models 
to generate a new plurality of candidate maps and a new 
family of agent-based models. 

20 

4B. A method for marketing and introducing novel 
products from consumer data comprising the steps of: 

creating a model of customer preferences; and 
identifying novel products using the method for 
25 determining customer demand of claim 23, 

49. A method for marketing and introducing novel 
products from consumer data wherein said creating a model of 
customer preferences step is performing using the method of 

30 claim 1. 

50. Compucer executable software code stored on a 
computer readable medium, the code for marketing and 
introducing novel products from consumer data, the code 

35 comprising; 

code to create a model of customer preferences; and 
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5 code to identify novel products using the method 

for determining customer demand of claim 23, 

51. Computer executable software code stored on a 
g coTT^uter readable medium, the code for marketing and 

introducing novel products from consumer data as in claim 50, 
wherein said code to create a modal of customer preferences 
is the code of claim 19. 
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This International Searching Authority found multiple inventions in dns international application, as follows: 
Please Sec Extra Sheet. 



* • Ait all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2, I I As &U searchable claims could be searched without effort justifyirig an additional fee. this Authority did not invHe payment 

of any additional fee. 

3. As only some of the required additional search fees were timely paid by the applicant, this inlcinational search report covers 
only those claims for which fees were paid, speciftcany claims Nos.: 



^' n required additional search fees were timely paid by the appUcanU Consequcnlly. this international search report is 
restricted to the invention Tirst mentioned *m the claims; it is covered by claims Nos.: 



Remark on Protest ^] The additional search fees were accompanied by the appUcam's protest. 

No protest accompanied the payment of additional search fees. 



Form PCT/lSA/210 (continuation of first shect(I))(July 1992)* 



INTERNATIONAL SEARCH REPORT 



[otemfttional appEcation No. 
PCT/US99/15236 



BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple tnventiona u foUowi: 

Tliis application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate addUional search 
fees must be paid. 

Group I. claims 1-22, drawn to partitioning a space of data. 

Groiq> II. claims 23-41 , 48, and 50, drawn to determining customer demand for products. 
Group in, claims 42-47, drawn to creating a model of consumer preferences from consumer data. 



The inventions listed as Groups I, 11, and HI do not relate to a single mventivc concept under PCT Rule 13.1 because, 
under PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: The 
three inventions are sub^mbinations \v1iich are Msab)c together, but may also function independently of each cAher. 



Form PCT/IS A/210 (extra shcct)(July 1992)* 



